Teaching the Principles of Statistical Dynamics 

^: n 

O . Kingshuk Ghosh and Ken A. DilO 

' Department of Biophysics, University of Califor-nia, San Francisco CA 94143 

Mandar M. Inamdar, Effrosyni Seitaridou, and Rob PhiUips^ 

Division of Engineering and Applied Science and ^ Kavli Nanoscience Institute, 
California Institute of Technology, Pasadena, CA 91125 



We describe a simple framework for teaching the principles that underlie the dynamical laws of 
transport: Pick's law of diffusion, Fourier's law of heat flow, the Newtonian viscosity law, and mass- 
^ , action laws of chemical kinetics. In analogy with the way that the maximization of entropy over 

■ microstates leads to the Boltzmann law and predictions about equilibria, maximizing a quantity 

that E. T. Jaynes called "Caliber" over all the possible microtrajectories leads to these dynamical 
laws. The principle of Maximum Caliber also leads to dynamical distribution functions which 
characterize the relative probabilities of different microtrajectories. A great source of recent interest 
in statistical dynamics has resulted from a new generation of single-particle and single-molecule 
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I. INTRODUCTION 



C/3 ^ experiments which make it possible to observe dynamics one trajectory at a time. 

■ PACS numbers: 51.10.-|-d 05.40.-a 05.70.Ln 

a 
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I We describe an approach for teaching the principles that underlie the dynamical laws of transport: of particles 
(Pick's law of diffusion), energy (Fourier's law of heat flow), momentum (the Newtonian law for viscosity)^ and 
mass-action laws of chemical kinetics^ Recent experimental advances now allow for studies of forces and flows at the 
J>. , single-molecule and nanoscale level, representative examples of which may be found in the references For 
00 ' example, single-molecule methods have explored the packing of DNA inside viruses^X and the stretching of DNA and 
00 ' RNA molecules iSiifl Similarly, video microscopy now allows for the analysis of trajectories of individual submicron 
size colloidal particlesii, and the measurement of single-channel currents has enabled the kinetic studies of DNA 
[ translocation through nanoporesi^ 

I One of the next frontiers in biology is to understand the "small numbers" problem: how does a biological cell 
' function, given that most of its proteins and nucleotide polymers are present in numbers much smaller than Avogadro's 
, number?— For example, one of the most important molecules, a cell's DNA, occurs in only a single copy. Also, it 

■ is the flow of matter and energy through cells that makes it possible for organisms to maintain a relatively stable 
^ , form^ Hence, in order to function, cells always have to be in this state far from equilibrium. Thus, many problems of 

I ' current interest involve small systems that are out of equilibrium. Our interest here is two-fold: to teach our students 
' O ' a physical foundation for the phenomenological macroscopic laws, which describe the properties of averaged forces and 
^ . flows, and to teach them about the dynamical fluctuations, away from those average values, for systems containing 
small numbers of particles. 

In this article, we describe a very simple way to teach these principles. We start from the "principle of Maximum 
Caliber", first described by E. T. Jaynes— It aims to provide the same type of foundation for dynamics of many- 
degree-of- freedom systems that the second law of thermodynamics provides for equilibria of such systems. To illustrate 
the principle, we use a slight variant of one of the oldest and simplest models in statistical mechanics, the Dog-Flea 
I Model, or Two-Urn Model— Courses in dynamics often introduce Pick's law, Fourier's law, and the Newtonian-fluid 
model as phenomenological laws, rather than deriving them from some deeper foundation. Here, instead, we describe a 
simple unified perspective that we have found useful for teaching these laws from a foundation in statistical dynamics. 
In analogy with the role of microstates as a basis for the properties of equilibria, we focus on microtrajectories as the 
basis for predicting dynamics. One argument that might be leveled against the kind of framework presented here is that 
in the cases considered here, it is not clear that it leads to anything different from what one obtains using conventional 
nonequilibrium thinking. On the other hand, often, restating the same physical result in different language can provide 
a better starting point for subsequent reasoning. This point was well articulated by Feynman in his Nobel lecturaii 
who noted: "Theories of the known, which are described by different physical ideas may be equivalent in all their 
predictions and are hence scientifically indistinguishable. However, they are not psychologically identical when trying 
to move from that base into the unknown. For different views suggest different kinds of modifications which might be 
made and hence are not equivalent in the hypotheses one generates from them in one's attempt to understand what 
is not yet understood." 

We begin with the main principle embodied in Pick's Law. Why do particles and molecules in solution fiow from 
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regions of high concentration toward regions of low concentration? To keep it simple, we consider one-dimensional 
diffusion along a coordinate x. This is described by Pick's first law of particle transport^i*^ which says that the average 
flux, (J), is given in terms of the gradient of average concentration, d{c)/dx, by 

where D is the diffusion coefficient. In order to clearly distinguish quantities that are dynamical averages from those 
that are not, we indicate the averaged quantities explicitly by brackets, (...). We describe the nature of this averaging 
below, and the nature of the dynamical distribution functions over which the averages arc taken. But first, wc bricfiy 
review the standard derivation of the diffusion equation. Combining Pick's first law with particle conservation, 

dt ^ dx ' ^ ' 

gives Pick's second law, also known as the diffusion equation: 

Solving Eq. ^ subject to two boundary conditions and one initial condition gives both {c{x,t)), the average con- 
centration in time and space, and the average fiux (J(.T,t)), when no other forces are present. The generalization to 
situations involving additional applied forces is the Smoluchowski equation^ 

A simple experiment shows the distinction between averaged quantities vs. individual microscopic realizations. 
Using a microfluidics chip like that shown in Pig. 1^), it is possible to create a small fluid chamber divided into 
two regions by control valves. The chamber is filled on one side with a solution containing a small concentration 
of micron-scale colloidal particles. The other region contains just water. The three control valves on top of that 
microfluidic chamber serve two purposes: The two outer ones are used for isolation so that no particles can diffuse 
in and out of the chamber, while the middle control valve provides the partition between the two regions. The time 
evolution of the system is then monitored after the removal of the partition (see Pig. (^)). The time-dependent 
particle density is determined by dividing the chamber into a number of equal-sized boxes along the long direction 
and by computing histograms of the numbers of particles in each slice as a function of time. This is a colloidal solution 
analog of the gas diffusion experiments of classical thermodynamics. The corresponding theoretical model usually 
used is the diffusion equation. Pig. |^) shows the solution to the diffusion equation, as a function of time, for the 
geometry of the microfluidics chip. The initial condition is a step function in concentration at x ~ 200/.im at time 
t = 0. 

We use this simple experiment to illustrate one main point. The key distinction is that the theoretical curves are 
very smooth, while there are very large fluctuations in the experimentally observed dynamics of the particle densities. 
The fluctuations are large because the number of colloidal particles is small, tens to hundreds. The experimental data 
shows that the particle concentration c{x,t) is a highly fluctuating quantity. It is not well described by the standard 
smoothed curves that are calculated from the diffusion equation. Of course, when averaged over many trajectories 
or when particles are at high concentrations, the experimental data should approach the smoothed curves that are 
predicted by the classical diffusion equation. Our aim here is to derive Pick's Law and other phenomenological 
transport relations at a microscopic level, so that we can consider both the behaviors of average properties and the 
fluctuations, i.e., the dynamical distribution functions, and to illustrate the Maximum Caliber approach. 



II. THE EQUILIBRIUM PRINCIPLE OF MAXIMUM ENTROPY 

We are interested here in dynamics, not statics. However, our strategy follows so closely the Jaynes derivation of 
the Boltzmann distribution law of equilibrium statistical mechanics that we flrst show the equilibrium treatment. 
To derive the Boltzmann law, we start with a set of equilibrium microstates i = 1,2,3, ... N that are relevant to the 
problem at hand. We aim to compute the probabilities pi of those microstates in equilibrium. Wc define the entropy, 
S, of the system as 



N 

S{{Pt}) = -fcs^pjnp,, 

4=1 



(4) 
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where ks is the Bohzmann constant. The cquihbrium probabihties, pi = p* are those values of pi that cause the 
entropy to be maximal, subject to two constraints: 

N 

T.P^ = ^^ (5) 

i=l 

which is a normalization condition that insures that the probabilities pi's sum to one, and 

{E)=J2P^E^, (6) 

i 

which says that the energies, when averaged over all the microstatcs, sum to the macroscopically observable average 
energy. 

Using Lagrange multipliers A and P to enforce the first and second constraints, respectively, leads to an expression 
for the values p* that maximize the entropy: 

^[-l-lnp*-A-/3i?,]=0. (7) 

i 

The result is that 

p: = (8) 

where Z is the partition function, defined by 

^ = 5:e-^^'. (9) 

i 

After a few thermodynamic arguments, the Lagrange multiplier /3 can be shown to be equal to l/ksT^ This 
derivation, first given in this simple form by Jaynes^^ identifies the probabilities that are both consistent with the 
observable average energy and that otherwise maximize the entropy. Jaynes justified this strategy on the grounds that 
it would be the best prediction that an observer could make, given the observable, if the observer is ignorant of all else. 
In this case, the observable is the average energy. While this derivation of the Boltzmann law is now quite popular, 
its interpretation as a method of prediction, rather than as a method of physics, is controversial. Nevertheless, for 
our purposes here, it does not matter whether we regard this as a description of physical systems or as a strategy for 
making predictions. 

Now, we switch from equilibrium to dynamics, but we use a similar strategy. We switch from the Principle of 
Maximum Entropy to what Jaynes called the Principle of Maximum Caliber In particular, rather than focusing on 
the probability distribution p{Ei) for the various microstates, we seek p[{(Ti(t)}], where (Jiit) is the i*'' microscopic 
trajectory of the system. Again we maximize an entropy-like quantity, obtained fromp[{(T.;(i)}], to obtain the predicted 
distribution of microtrajectories. If there are no constraints, this maximization results in the prediction that all the 
possible microtrajectories are equally likely during the dynamical process. In contrast, certain microtrajectories will 
be favored if there are dynamical constraints, such as may be specified in terms of the average fiux. 

In the following section, we use the Maximum Caliber strategy to derive Pick's Law using the Dog-Flea model, 
which is one of the simplest models that contain the physics of interest. 



III. FICK'S LAW FROM THE DOG-FLEA MODEL 



We want to determine the diffusive time evolution of particles in a one-dimensional system. The key features of this 
system are revealed by considering two columns of particles separated by a plane, as shown in Fig. (j^J. The left-hand 
column (1) has Ni{t) particles at time t and the right-hand column (2) has N2{t) particles. This is a simple variant of 
the famous "Dog-Flea" model of the Ehrenfest's introduced in 1907pi^4i& Column (1) corresponds to Dog (1), which 
has A^i fleas on its back at time and column (2) corresponds to Dog (2), which has N2 ficas at time t. In any time 
interval between time t and t + At, any flea can cither stay on its current dog, or it can jump to the other dog. This 
model has been used extensively to study the Boltzmann H-theorem and to understand how the time asymmetry of 
diffusion processes arises from an underlying time symmetry in the laws of motion^^iiS*iS Our model is used for a 
slightly different purpose. In particular, our aim is to take a well-characterized problem like diffusion and to reveal 
how the Principle of Maximum Caliber may be used in a concrete way. We follow the conventional definition of flux. 
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of a number of particles transferred per unit time, and per unit area. For simplicity, we take the cross-sectional area 
to be unity. 

First, consider the equilibrium state of our Dog-Flea model. The total mmrbcr of ways of partitioning the (A^i -l-iV2) 
fleas (particles) is W^(iVi,iV2), 

The state of equilibrium is that for which the entropy, S — kB^'aW is maximal. A simple calculation shows that the 
entropy is maximal when the value A'^i ~ is as nearly equal to N2 — N2 as possible. In short, at equilibrium, both 
dogs will have approximately the same number of fleas, in the absence of any bias. 

Our focus here is on the dynamics - on how the system reaches that state of equilibrium. We discretizc time into a 
series of intervals At. We define a dynamical quantity p, which is the probability that a particle (flea) jumps from one 
column (dog) to the other in any time interval At. Thus, the probability that a flea stays on its dog during that time 
interval is q ~ 1 — p. We assume that p is independent of time t, and that all the fleas and jumps arc independent of 
each other. 

In equilibrium statistical mechanics, the focus is on the microstates. However, for dynamics we focus on processes, 
which, at the microscopic level, we call the microtrajectory. Characterizing the dynamics requires more than just 
information about the microstates; we must also consider the processes. Let mi represent the number of particles 
that jump from column (1) to (2), and 7712 is the number of particles that jump from column (2) to (1), between time 
t and t + At. There are many possible different values of mi and 7712: it is possible that no fleas will jump during the 
interval At, or that all the fleas will jump, or that the number of fleas jumping will be in between those limits. Each 
one of these different situations corresponds to a distinct microtrajectory of the system in this idealized dynamical 
model. We need a principle to tell us what number of fleas will jump during the time interval At at time t. Because 
the dynamics of this model is so simple, the implementation of the caliber idea is reduced to nothing more than a 
simple exercise in enumeration and counting using the binomial distribution. 

A. The Dynamical Principle of Maximum Caliber 

The probability, Wd{rni, TO2|A^i, A^2), that mi particles jump to the right and that m2 particles jump to the left in 
a discrete unit of time At, given that there are Ni{t) and N2{t) fleas on the dogs at time t is 



Wd{mi,m2\Ni{t),N2{t)) 



pTiii -nil 



TOi!(iVi — mi)! 



N-y I 

prn2 qN2~rn2 



m2!(A^2 - m2)\ 



(11) 



Wd is a count of microtrajectories in dynamics problems in the same vein that W counts microstates for equilibrium 
problems. In the same spirit that the Second Law of Thermodynamics says to maximize W to predict states of 
equilibrium, now for dynamics, we maximize Wd over all the possible microtrajectories (i.e. over mi and TO2) to 
predict the fluxes of fleas between the dogs. This is the implementation of the Principle of Maximum Caliber within 
this simple model. Maximizing Wd over all the possible processes (different values of mi and m2) gives our prediction 
(right flux mi = m\ and left flux m2 = rn^) for the macroscopic flux that we should observe in experiments. 

Since the jumps of the fleas from each dog are independent, we find our predicted macroscopic dynamics by 
maximizing W^i and W^^ separately, or for convenience their logarithms: 



dlnWd, 



m,. 



0, i^l,2. (12) 



N .rrii—m* 

Note that applying Stirling's approximation to Eq. Wd gives: 

In Wdi ~ TTii Inp + {Ni — mi)lnq + Ni In Ni — m^ In rrii — {Ni — m^) ln(iVi — m^). (13) 
We call C= In Wd the caliber. Maximizing C with respect to m gives 

dlnWd, 



dnii 

This result may be simplified to yield 



= \np-\nq- In m* + ln(iV, - m*) = 0. (14) 
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which imphcs that the most probable jump number is simply given by 

m*=pN,. (16) 

But since our probability distribution Wd is nearly symmetric about the most probable value of flux, the average 
number and the most probable number are approximately the same. Hence, the average net flux to the right will be, 

pAx'^ Ac{x, t) 
At Ax 

which is Fick's law, in this simple model, and where the diffusion constant is given by D pAx^/At. We have 
rewritten Ni — N2 = — Ac Ax. 

Hence we have a simple explanation for why there is a net flux of particles diffusing across a plane down a concen- 
tration gradient: more microscopic trajectories lead downhill than uphill. It shows that the diffusion constant D is a 
measure of the jump rate p. This simple model does not make any assumptions that the system is "near-equilibrium" , 
i.e., utilizing the Boltzmann distribution law, for example, and thus it indicates that Fick's Law ought also to apply 
far from equilibrium. For example, we could have imagined that for very steep gradients, Fick's Law might have been 
only an approximation and that diffusion is more accurately represented as a series expansion of higher derivatives of 
the gradient. But, at least within the present model, Fick's Law is a general result which emerges from counting up 
microtrajectories. On the other hand, we would expect Fick's law to break down when the particle density becomes so 
high that the particles start interacting with each other thus spoiling the assumption of independent particle jumps. 

B. Fluctuations in Diffusion 

Above, we have shown that the most probable number of fleas that jump from dog (1) to dog (2) between time t 
and t + At is to* = pNi(t). The model also tells us that sometimes we will have fewer fleas jumping during that time 
interval, and sometimes we will have more fleas. These variations are a reflection of the fluctuations resulting from 
the system following different microscopic pathways. 

We focus now on predicting the fluctuations. To illustrate, let us first make up a table of Wd, the different numbers 
of possible microtrajectories, taken over all the values of mi and 7712 (Table (P)). To keep this illustration simple, let 
us consider the following particular case: Ni{t) — 4 and N2{t) = 2. Let us also assume p = q = 1/2. Here, then, are 
the multiplicities of all the possible routes of flea flow. A given entry tells how many microtrajectories correspond to 
the given choice of toi and TO2. 

Notice flrst that the table conflrms our previous discussion. The dynamical process for which Wd is maximal (12 
microtrajectories, in this case), occurs when ml = pNi = 1/2 x 4 = 2, and m2 = pN2 = 1/2 x 2 = 1. You can 
compute the probability of that particular flux by dividing Wd = 12 by the sum of entries in this table, which is 
2^ = 64 the total number of microtrajectories. The result, which is the fraction of all the possible microtrajectories 
that have ml = 2 and m^ = 1, is 0.18. We have chosen an example in which the particle numbers are very small, 
so the fluctuations arc large; they account for more than 80 percent of the flow. In systems having large numbers of 
particles, the relative fluctuations are much smaller than this. 

Now look at the top right corner of this table. This entry says that there is a probability of 1/64 that both fleas on 
dog (2) will jump to the left while no fleas will jump to the right, implying that the net flux, for that microtrajectory, 
is actually backwards, relative to the concentration gradient. We call these "bad actor" microtrajectories. In those 
cases, particles flow to increase the concentration gradient, not decrease it. Traditionally, "Maxwell's Demon" was an 
imaginary microscopic being that was invoked in similar situations in heat flow processes, i.e., where heat would flow 
from a colder object to heat up a hotter one, albeit with low probabilityiSS In particular, the Demon was supposed 
to capture the bad actor microtrajectories. At time t, there are 4 fleas on the left dog, and 2 on the right. At the 
next instant in time, t + At, all 6 fleas are on the left dog, and no fleas are on the right-hand dog. Notice that this is 
not a violation of the Second Law, which is a tendency towards maximum entropy, because the Second Law is only a 
statement that the average flow must increase the entropy; it says nothing about the fluctuations. 

Similarly, if you look at the bottom left corner of the table, you see a regime of "superflux" : a net flux of 4 particles 
to the right, whereas Fick's Law predicts a net flow of only 2 particles to the right. This table illustrates that Fick's 
Law is only a description of the average or most probable flow, and it shows that Fick's Law is not always exactly 
correct at the microscopic level. However, such violations of Fick's Law are of low probability, a point that we will 
make more quantitative below. Such fluctuations have been experimentally measured in small systems^ 

We can further elaborate on the fluctuations by deflning the "potencies" of the microtrajectories. We deflne the 
potency to be the fraction of all the trajectories that lead to a substantial change in the macrostate. The potencies of 
trajectories depend upon how far the system is from equilibrium. To see this, let us continue with our simple system 



{J{t)) = 



At 



N,{t)-N2{t) 
At 
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having 6 particles. The total number of microscopic trajectories available to this system at each instant in our discrete 
time picture is 2^ = 64. Suppose that at < = all 6 of these particles are in dog (1). The total number of microscopic 
trajectories available to the system can be classified once again using mi and TO2, where in this case m2 = since 
there are no fleas on dog (2) (See Table. (|IT)l 'l. 

What fraction of all microtrajectories changes the occupancies of both dogs by more than some threshold value, 
say ANi > 1? In this case, wc find that 57 of the 64 microtrajectories cause a change greater than this to the current 
state. We call these potent trajectories. 

Now, let us look at the potencies of the same system of 6 particles in a different situation, A^i = = 3 when 
the system is in macroscopic equilibrium (see Table. Here, only the trajectories with (mi, 7712) pairs given by 

(0, 2), (0, 3), (1, 3), (2, 0), (3, 0), and (3, 1) satisfy our criterion. Summing over all of these outcomes shows that just 14 
of the 64 trajectories are potent in this case. 

There are two key observations conveyed by these arguments. First, for a system far from equilibrium, the vast 
majority of trajectories at that time t are potent, and move the system significantly away from its current macrostate. 
Second; when the system is near equilibrium, the vast majority of microtrajectories leave the macrostate unchanged. 
Let us now generalize from the tables above, to see when fluctuations will be important. 



1. Fluctuations and Potencies 



A simple way to characterize the magnitude of the fluctuations is to look at the width of the Wd distribution^ 
It is shown in standard statistics texts that for a binomial distribution such as ours, for which the mean and most 
probable value both equal m* = Npi, the variance is af = Nipq. The variance characterizes the width. Moreover, if 
Ni is sufficiently large, a binomial distribution can be well-approximated by a Gaussian distribution 

Vim.,N.) = ^^exp (-^^), (17) 
y/2TTNipq \ 2N.ipq J 

an approximation we find convenient since it leads to simple analytic results. However, this distribution function is 
not quite the one we want. We are interested in the distribution of flux, P{J) ~ P{mi — TO2), not the distribution of 
right-jumps mi or left-jumps m2 alone, V{m). 

However, due to a remarkable property of the Gaussian distribution, it is simple to compute the quantity we want. 
If you have two Gaussian distributions, one with mean (xi) and variance di, and the other with mean {X2) and 
variance (t|, then the distribution function, P{xi — X2) for the difference will also be a Gaussian distribution, having 
mean (xi) — (2:2) and variance a'^ = af + cr|. 

For our binomial distributions, the means are mi = pNi and mj = pN2 and the variances are — Nipq and 
cr| = N2pq, so the distribution of the net flux, J = mi — m2 is 

PiJ) - exp (J'^-Pi^^-mr) , (18) 

^ ^ ^2TT{pqN) \ 2pqN J' ^ ' 

where N ^ Ni + N2. 

Figure ^ shows an example of the distributions of fluxes at different times, using p ~ 0.1, and starting from 
iVi = 100, A^2 = 0. We update each time step using an averaging scheme, iVi (t + At) ~ Ni (t) — A^i {t)p + N2 {t)p. The 
figure shows how the mean flux is large at first and decays towards equilibrium, J = 0. This result could also have 
been predicted from the diffusion equation. However, equally interesting are the wings of the distributions, which 
show the deviations from the average flux, and these are not predictable from the diffusion equation. One measure 
of the importance of the fluctuations in a given dynamical problem is the ratio of the standard deviation ct, to the 



^ (19) 



iNi-N2)p 



In the limit of large iVi, the above reduces to, 



^ iV-V2. (20) 



J (TVi - N2)p 

In a typical bulk experiment, particle numbers are large, of the order of Avogadro's number 10^""*. In such cases, the 
width of the flux distribution is exceedingly small and it becomes overwhelmingly probable that the mean flux will 
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be governed by Pick's law. However, within biological cells and in applications involving small numbers of particles, 
the variance of the flux can become significant, ft has been observed that both rotary and translational single motor 
proteins sometimes transiently step backwards, relative to their main direction of motionpS^ 

As a measure of the fluctuations, we now calculate the variance in the flux, ft follows from Eq. (|18|l that (J^) = Npq, 
where N = Ni + A^2- Thus, we can represent the magnitude of the fluctuations as S, 



{jy ~ pfN fip'' ' 



where the quantity = A^i + A'2 is the total number of fleas and / = [Ni — N2)/{N), the normalized concentration 
difference. The quantity S is also a measure of the degree of backflux. In the limit of large N, 5 goes to zero. That is, 
the noise diminishes with system size. However, even when N is large, S can still be large (indicating the possibility 
of backflux) if the concentration gradient, iVi — iV2, is small. 

Let us look now at our other measure of fluctuations, the potency. Trajectories that are not potent should have 
I mi — 7712! ~ which corresponds to a negligible change in the current state of the system as a result of a given 
microtrajectory. In Fig. (0}, the impotent microtrajectories are shown as the shaded band for which mi « m2. To 
quantify this, we define impotent trajectories as those for which |mi — m2| < h, {h <^ N). In the Gaussian model, 
the fraction of impotent trajectories is 



impotent 



dJ , ^ = 



cxp 



erf 



h + {Ni- N2)p 



-{J ~ {N^ - N2)pf 
2Npq 



erf 



N2)P 



(21) 
(22) 



and corresponds to summing over the subset of trajectories that have a small flux. To keep it simple, we did a 
computation, taking p = q = 1/2, and for which the expression for the probability distribution for the microscopic 
flux nil — 1712 is given by Eq. H18() . The choice of h is arbitrary, so let us just choose h to be one standard deviation, 
•y/iV/4. Fig. ^ shows potencies for various values of Ni and N2. When the concentration gradient is large, most 
trajectories are potent, leading to a statistically significant change of the macrostate, whereas when the concentration 
gradient is small, most trajectories have little effect on the macrostate. 

As another measure of fluctuations, let us now consider the "bad actors" (see Fig. ©). If the average flux is in the 
direction from dog (1) to dog (2), what is the probability you will observe flux in the opposite direction (bad actors)? 
Using Eq. (UHJ for P(J), we get 



badactors 



1 



y/2TTNpq 



exp 



-{ J - (Ni - N2)p)' 



- I 1 - erf 



(A^i - N2)p 



2Npq 
,N2>Ni 



(23) 
(24) 



which amounts to summing up the fraction of trajectories for which J < 0. Figure iQ shows the fraction of bad 
actors for p — q — 1/2. Bad actors are rare when the concentration gradient is large, and highest when the gradient 
is small. The discontinuity in the slope of the curve in Fig. (jT)) at Ni/N = 0.5 is a reflection of the fact that the mean 
flux abruptly changes sign at that value. 



IV. FOURIER'S LAW OF HEAT FLOW 

While particle flow is driven by concentration gradients, according to Fick's law, (J) = — Z?^, energy flow is driven 
by temperature gradients, according to Fourier's lawi: 

dT 

w - 

Here, Jq is the energy transferred per unit time and per unit cross-sectional area by heat flow and dT / dx is the 
temperature gradient that drives it, indicated here for the one-dimensional case, k, the thermal conductivity^i plays 
the role that the diffusion coefficient plays in Fick's Law. 

To explore Fourier's law, we return to the Dog-Flea model as described in Sec. HI. Now, columns (1) and (2) can 
differ not only in their particle numbers, Ni(t) and N2{t), but also in their temperatures, Ti(t) and T2{t). To keep 
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it simple here, we assume that each column is at thermal equilibrium and that each particle that jumps carries with 
it the average energy, {mv'^ /2) = kBT/2 from the column it left. Within this simple model, all energy is transported 
by hot or cold molecules switching dogs. Although in general, heat can also flow by other mechanisms mediated by 
collisions, for example, our aim here is just the simplest illustration of principle. The average heat flow at time t is 

W = ^(^5^1/2) - ^(fcBT2/2) = ^[N,T, N,T,] (25) 

where and are, as defined in Sec. IIIA the numbers of particles jumping from each column at time t. If the 
particle numbers are identical, A^/2 = A^i = N2, then 

which is Fourier's Law for the average heat flux, within this two-column model. The model predicts that the thermal 
conductivity is k = {pkBNAx)/{At), which can be expressed in a more canonical form as k = pkBnVavAx, when 
written in terms of the particle density n = N/ Ax and the average velocity, v^-v ~ Ax/ At. Our simple model gives 
the same thermal conductivity as given by the kinetic theory of gases^ k = (l/2)fcBnWav^j where I is the mean free 
path, if pAa; in our model corresponds to 1/2, half the mean-free path. Hence, this simple model captures the main 
physical features of heat flow, again by appealing to the idea of summing over the weighted microtrajectories available 
to the system. 

V. NEWTONIAN VISCOSITY 

Another phenomenological law of gradient-driven transport is that of Newtonian viscosities^ 

ax 

where r is the shear stress that is applied to a fluid, dvy/dx is the resultant shear rate, and the proportionality 
coefficient, ry, is the viscosity of a Newtonian fluid. Whereas Fick's law describes particle transport and Fourier's law 
describes energy transport, this Newtonian law describes the transport (in the x-direction, from the top moving plate 
toward the bottom fixed plate) of linear momentum that acts in the y-direction (parallel to the plates) (see Fig. ((Sj)). 
Returning to the Dog-Flea model of Sec. Ill, suppose each particle in column (1) carries momentum mvy^ along the 
y-axis, and that m\ particles hop from column (1) to (2) at time t, carrying with them some linear momentum. As 
before, we consider the simplest model that every particle carries the same average momentum from the column it 
leaves to its destination column. 

The fiux, Jp, is the amount of y-axis momentum that is transported from one plane to the next in the x-dircction, 
per unit area: 

/ T X ™1 / X "^2 / \ P™ TAT AT 1 

Ai ^^'"y^'^ 'At ^^'"y^^ 'At ^ ~ ^2Vy,_\. 
If the number of particles is the same in both columns, N/2 ~ Ni ~ N2, this simplifies to 

which is the Newtonian law of viscosity in this two-column model. The viscosity is predicted by this model to be 
77 = {pmN Ax) / (At) . Converting this to a more canonical form gives i] = pmnVavAx, where n = N/ Ax is the particle 
density, and iiav = Ax/ At is the average velocity. This is equivalent to the value given by the kinetic theory of gasesji 
77 = {\/?))mnlvav: \i pAx from our model equals (1/3)Z, one-third of the mean-free path length. Note that this simple 
model based upon molecular motions will clearly not be applicable to complex fluids where the underlying molecules 
possess internal structure. 

VI. CHEMICAL KINETICS WITHIN THE DOG-FLEA MODEL 



Let us now look at chemical reactions using the Dog-Flea model. Chemical kinetics can be modeled using the 
Dog-Flea model when the fleas have preference for one dog over the other. Consider the reaction 
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The time-dependent average concentrations, [A]{t) and [B]{t) are often described by chemical rate equations^ 



dt 
d[B] 
dt 



= -kf[A]+kr[B] 

= kf[A] - kr[B] 



(26) 



where kf is the average conversion rate of an A to a B, and kr is the average conversion rate of a B to an A. These 
rate expressions describe only average rates; they do not tell us the distribution of rates. Some A's will convert to B's 
faster than the average rate predicts, and some will convert more slowly. Again, we use the Dog-Flea model as 

a microscopic model for this process. We use it to consider both the average concentrations and the fluctuations in 
concentrations . 

Now, dog (1) represents chemical species A and dog (2) represents chemical species B. The net chemical flux from 
1 to 2 is given by Jc = — m2 . What is different about our model for these chemical processes than in our previous 
situations is that now the intrinsic jump rate from column 1 (species A), pi, is different than the jump rate from 
column 2, p2. This simply reflects the fact that a forward rate can differ from a backward rate in a chemical reaction. 
Now, fleas have a different escape rate from each dog. Fleas escape from Dog (1) at rate pi and fleas escape from Dog 
(2) at rate p2. Maximizing Wd gives to* = Nipi and = N2P2, so the average flux (which is the almost the same 
as the most probable flux because of the approximately symmetric nature of the binomial distribution) at time t is, 

(J) = iVipi - N2P2 ^kf[A]- kr[B], 

which is just the standard mass-action rate law, expressed in terms of the mean concentrations. The mean values 
satisfy detailed balance at equilibrium ( (J)=0 =J> N2/N1 = p\/p2 = kf/kr). 

More interesting than the well-known behavior of the mean chemical reaction rate is the fluctuational behavior. 
For example, if the number of particles is small, then even when kf[A] — kr[B] > 0, indicating an average conversion 
of A's to B's, the reverse can happen occasionally instead. When will these fluctuations be large? As in Sec. Ill, we 
first determine the probability distribution of the fiux J. In this case, the probability distribution becomes: 



P(J) 



yj2'K{piqiNi + p2q2N2 



: CXp 



(J - (A^lPl -iV2P2))^ 

2{piqiNi +P2q2N2) 



(27) 



Again, let us use this flux distribution function to consider the fluctuations in the chemical reaction. The relative 
variance in the flux is 

((AJ)^) ^ VNipiqi + N2P2q2 

((J))2 ~ N1P1-N2P2 ■ 

As before, the main message is that when the system is not yet at equilibrium (i.e., the denominator is non-zero), 
macroscopically large systems will have negligibly small fluctuations. The relative magnitude of fluctuations scales 
approximately as N~^-^/^\ Let us also look at the potencies of microtrajectories as another window into fluctuations. 
Using Eq. (|22|) with pi and p2 gives the fraction of trajectories that arc impotent as 



impotent 



dj- 



1 



-h 



y/2TT{Nipiqi + N2P2q2) 



exp 



erf 



h + {Nipi - N2P2) 
y/2{Nipiqi + N2P2q2) 



■ erf 



-(J-(iVlPl-iV2P2))^ 

2(7Vipi(ji -f- N2P2q2) 

h - {Nipi - N2P2) 
yj2{Nipiqi + N2P2q2) 



(28) 
(29) 



Using Ni + N2 = N ^ 100, and pi = 0.1 and p2 = 0.2, ^potent = 1 — '^impotent as a function of Ni/N is shown in 
Fig. El 



VII. DERIVING THE DYNAMICAL DISTRIBUTION FUNCTION FROM MAXIMUM CALIBER 

Throughout this paper, we have used the binomial distribution function, Wd, as the basis for our treatment of 
stochastic dynamics. The Maximum Caliber idea says that if we find the value of Wd that is maximal with respect to 
the microscopic trajectories, this will give the macroscopically observable fiux. Here, we now restate this in a more 
general way, and in terms of the probabilities of the trajectories. 
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Let P{i) be the probability of a microtrajectory i during the interval from time t to t + At. A microtrajectory is 
a specific set of fleas that jump; for example microtrajectory i = 27 might be the situation in which fleas number 4, 
8, and 23 jump from dog (1) to (2). We take as a constraint the average number of jumps, (m), the macroscopic 
observable. The quantity rrii = 3 in this case indicates that trajectory i involves 3 fleas jumping. We express the 
caliber C as 

C = Y^ P{i) In P{i) - A ^ m,P{i) - a X! 

i i i 

where, A is the Lagrange multiplier that enforces the constraint of the average flux and a is the Lagrange multiplier 
that enforces the normalization condition that the P(iys sum to one. Maximizing the caliber gives the populations 
of the microtrajectories, 

P{i) = cxp(— a — Xm.i). (31) 

Note that the probability P{i) of the i^^ trajectory depends only on the total number uii of the jumping fleas. Also, 
all trajectories with the same rrii have same probabilities. Now, in the same way that it is sometimes useful in 
equilibrium statistical mechanics to switch from microstates to energy levels, we now express the population P{i) of a 
given microtrajectory, instead, in terms of p(m) the fraction of all the microtrajectories that involve m jumps during 
this time interval, 

p{m) = g{m)Q{m), (32) 

where, g{m) ~ N\/[m\{N — m)!] is the "density of trajectories" with flux m (in analogy with the density of states 
for equilibrium systems), and Q{m) is the probability P{i) of the microtrajectory i with rrii = m. In other words, i 
denotes a microtrajectory (specific set of fleas jumping) while m denotes a microprocess (the number of fleas jumping) . 
The total number of z's associated with a given m is precisely g{m). It can also be easily seen that, 

N N 



J2 m = E 9{m)Q{m) = E /'("^) = 1 (33) 

i 771 — 777 — 

N N 

(m) = ^^m,;P(i) = mg{m)Q{'m) = mp{m). (34) 

i 7n— 7n— 

Thus, the distribution of jump-processes written in terms of the jump number, to, is 

TV' 

^("^) ^ ~~uTt TT exp(-a) cxp(-Ato). (35) 

m\[N — to)! 

The Lagrange multiplier a can be eliminated by summing over all trajectories and requiring that X^m =o/'('^*) ~ 
i.e., 

^ ^ /VI 

exp(a) = E 5Me-'" - ^ UN ' v "''" = + (^6) 
^-^ ^-^ m\(N ~ my. 

tin m ^ ' 



Combining Eqs. (|35|l and H36() gives 



If we now let 



A^! exp(— Am) 

to!(A^-to)! (1 +cxp(-A))^ 



then we get 



exp(— A) 
' 1 + exp(-A) ' 

exp(— Ato) 
(1 + exp(-A))'^ 



(38) 



(39) 
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and 



N~rn 



1 



(40) 



(l + cxp(-A))^ 



—ni 



Combining Eas. (|?7|l . p^ . and gives the simple form 




iV! 



that we have used throughout this paper in Eq. (|ll|l . 



VIII. SUMMARY AND COMMENTS 



We have shown how to derive the phenomenological laws of noncquilibrium transport, including Fick's law of 
diffusion, the Fourier law of heat conduction, the Newtonian law of viscosity, and mass-action laws of chemical 
kinetics, from a simple physical foundation that can be readily taught in elementary courses. We use the Dog-Flea 
model, originated by the Ehrenfests, for describing how particles, energy, or momentum can be transported across a 
plane. We combine that model with the Principle of Maximum Caliber, a dynamical analog of the way the Principle 
of Maximum Entropy is used to derive the laws of equilibrium. In particular, according to Maximum Entropy, you 
maximize the entropy S{pi,p2, ■ ■ ■ ,Pn) with respect to the probabilities pi of A'' microstates, subject to constraints, 
such as the requirement that the average energy is known. That gives the Boltzmann distribution law. Here, for 
dynamics, we focus on microtrajectories, rather than microstates, and we maximize a dynamical entropy-like quantity, 
subject to an average flux constraint. In this way, maximizing the caliber is the dynamical equivalent of minimizing a 
free energy for predicting equilibria. A particular value of this approach is that it also gives us fluctuation information, 
not just averages. In diffusion, for example, sometimes the flux can be a little higher or lower than the average value 
expected from Fick's Law. These fluctuations can be particularly important for biology and nanotechnology, where 
the numbers of particles can be very small, and therefore where there can be significant fluctuations in rates, around 
the average. 
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FIG. 1: Colloidal free expansion setup to illustrate diffusion involving small numbers of particles, (a) Schematic of experimental 
setup (see text for details.) (b) Several snapshots from the experiment, (c) Normalized histogram of particle positions during 
the experiment. The solution to the diffusion equation for the microfluidic "free expansion" experiment is superposed for 
comparison. 
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Dog1 Dog 2 



FIG. 2: Schematic of the simple dog-flea model, (a) State of the system at time t, (b) a particular microtrajectory in which 
two-fleas jump from the dog on the left and one jumps from the dog on the right, (c) occupancies of the dogs at time t + At. 
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FIG. 3: Schematic of the distribution of fluxes for different time points as the system approaches equilibrium. 



t=0 





t =^ 



FIG. 4: Schematic of which trajectories are potent and which are impotent. The shaded region corresponds to the impotent 
trajectories for which mi and m2 are either equal or approximately equal and hence make relatively small change in the 
macrostate. The unshaded region corresponds to potent trajectories. 
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FIG. 5: Illustration of the potency of the microtrajectories associated with different distributions of A'' particles on the two 
dogs. The total number of particles Ni + N2 = N ^ 100. 
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FIG. 6: Illustration of the notion of bad actors. Bad actors are essentially the microtrajectories which contribute net particle 
motion which has the opposite sign from the macroflux. 
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FIG. 7: The fraction of all possible trajectories that go against the direction of the macroflux, for A'' = 100. The fraction of 
bad actors is highest at A^i = N/2 = 50. 





FIG. 8: Illustration of Newton's law of viscosity. The fluid is sheared with a constant stress. The fluid velocity decreases 
continuously from its maximum value at the top of the fluid to zero at the bottom. There is thus a gradient in the velocity 
which can be related to the shear stress in the fluid. 




FIG. 9: The fraction of potent trajectories, "l>potcnt 5 as a function of Ni/N for A''i + = N — 
are not equal. The minimum value of potency does not occur at Ni/N = 0.5, but at Ni/N -- 
corresponds to its equilibrium value given by P2/(pi +P2). 



: 100, when pi = 0.1 and P2 = 0.2 
= 0.66. This value of Ni/N also 
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TABLE I: Trajectory multiplicity table for the specific case where Ni{t) = 4 and N2{t) = 2. Each entry in the table corresponds 
to the total number of trajectories for the particular values of mi and m2 . 
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TABLE II: Trajectory multiplicity table for the specific case Ni{t) = 6 and N2{t) = when the system is far from macroscopic 
equilibrium. 



mi \ 





12 3 





1 


3 3 1 


1 


3 


9 9 3 
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3 3 1 



TABLE III: Trajectory multiplicity table for the specific case Ni{t) = 3 and N2{t) — 3 when the system is at the macroscopic 
equilibrium. 



